Current Pharmaceutical Design, 2006, 12, 4573-4590 


4573 


Drug Design Targeting the Main Protease, the Achilles' Heel of 


Coronaviruses 


Haitao Yang, Mark Bartlam and Zihe Rao* 


National Laboratory of Biomacromolecules, Institute of Biophysics (IBP), Chinese Academy of Sciences, Beijing 
100101, China and Tsinghua-IBP Joint Research Group for Structural Biology, Tsinghua University, Beijing 100084, 


China 


Abstract: Coronaviruses (CoVs), a genus containing about 26 known species to date, cause highly prevalent diseases and 
are often severe or fatal in humans and animals. In 2003, a previously unknown coronavirus was identified to be the 
etiological agent of a global outbreak of a form of life-threatening pneumonia called severe acute respiratory syndrome 
(SARS). No efficacious therapy is currently available, and vaccines and drugs are under development to prevent SARS- 


CoV infection in many countries. The CoV main protease (M! 


pro 


), which plays a pivotal role in viral gene expression and 


replication through a highly complex cascade involving the proteolytic processing of replicase polyproteins, is an attrac- 
tive target for drug design. This review summarizes the recent advances in biological and structural studies, together with 
development of inhibitors targeting CoV Ms. It is expected that inhibitors targeting CoV M?"°s could be developed into 
wide-spectrum antiviral drugs against existing and possible future emerging CoV-associated diseases. 
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1. INTRODUCTION 


Coronaviruses, characterized as enveloped, positive- 
stranded RNA viruses with the largest known genome, be- 
long to the genus Coronavirus of the family Coronaviridae 
[1, 2]. There are approximately 26 species of coronaviruses 
(CoVs) [3-6] which can be classified into three distinct 
groups according to their genome sequences and serological 
reactions [2]. They infect humans and multiple species of 
animals, causing a variety of highly prevalent and severe 
diseases. For example, human coronavirus (HCoV) strains 
229E (HCoV-229E), NL63 (HCoV-NL63), OC43 (HCoV- 
OC43), and HKU1 (HCoV-HKU1) are responsible for a 
significant portion of upper and lower respiratory tract in- 
fections in humans, including common colds, bronchiolitis, 
and pneumonia. They have also been implicated in otitis 
media, exacerbations of asthma, diarrhea, myocarditis, and 
neurological disease [1, 7-11]. In 2003, a previously un- 
known HCoV called severe acute respiratory syndrome 
coronavirus (SARS-CoV), the emergence of which was 
most likely as a result of animal-human transmission [12], 
was identified as the etiological agent of a global outbreak 
of a life-threatening form of pneumonia called severe acute 
respiratory syndrome (SARS) [13-16]. SARS ultimately 
infected more than 8000 people and killed approximately 
800 people worldwide, with about 10% mortality rate, be- 
fore it was effectively brought under control. From the be- 
ginning of the SARS epidemic, efforts to identify the natural 
host for SARS-CoV have never ceased. Recently two inde- 
pendent research groups have discovered a coronavirus 
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closely related to SARS-CoV in bats, which has prompted a 
new health alert [5, 6]. Animal coronaviruses, including 
avian infectious bronchitis virus (IBV), turkey coronavirus 
(TCV), porcine transmissible gastroenteritis virus (TGEV), 
porcine hemagglutinating encephalomyelitis virus (HEV), 
canine coronavirus (CCoV), feline infectious peritonitis 
virus (FIPV) and bovine coronavirus (BCV), are significant 
pathogens for chickens, turkeys, pigs, dogs, cats and cattle 
[2, 17]. Most of the known coronaviruses are highly infec- 
tious with high mortality in young animals, resulting in sig- 
nificant economic losses for the animal industry worldwide. 


Although vaccines against some animal coronaviruses, 
such as IBV and CCoV, are routinely used to prevent seri- 
ous diseases, there are several potential problems [1, 3, 11]. 
At present, there are no licensed vaccines or specific drugs 
are available to prevent HCoV infection [1, 11]. Due to lack 
of efficacious therapies, the mortality rate was high during 
the SARS outbreak. Consequently, great efforts have been 
focused on the development of vaccines and drugs against 
SARS-CoV. Viral-vectored vaccines [18-20] and DNA vac- 
cines [21-23] have been tested in animal models with suc- 
cessful results, and currently an inactivated SARS-CoV 
vaccine is in clinical trials in China [24]. However, safety 
remains the major concern. Drug development strategies are 
focused on two main avenues: inhibitors to block virus entry 
into the host cells, and compounds to prevent viral replica- 
tion and transcription. The CoV main protease (M?°), which 
plays a pivotal role in mediating viral replication and tran- 
scription, is a particularly attractive target for anti-SARS 
drug design [25, 26]. Several reviews published previously 
have touched on the topic of coronavirus main proteases 
[25-27]. In this review, recent advances in biological and 
structural studies, and particularly in development of in- 
hibitors of the CoV Ms, will be summarized and dis- 
cussed. 


© 2006 Bentham Science Publishers Ltd. 
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2. STRUCTURE AND FUNCTION 
VIRUS MPR° 


2.1. Overview 


OF CORONA- 


As the largest RNA viruses known to date, the size of 
coronavirus genomes range from 27 to 31 kb [5, 7, 8, 28- 
40]. About two-thirds of the genome encodes two overlap- 
ping polyproteins, ppla (450-500 kDa) and pplab (750-800 
kDa), which undergo extensive proteolytic processing by 
viral proteases to produce multiple functional subunits. 
These functional subunits are involved in formation of the 
replicase complex to carry out viral replication and tran- 
scription [25, 26, 41]. The viral proteases mentioned above 
are classified into accessory proteases and the main protease 
(MP®, also called 3C-like protease). Accessory proteases are 
papain-like cysteine proteases which cleave the N-proximal 
polyproteins regions at two or three limited sites [25, 26]. 
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The main protease is a chymotrypsin-like cysteine protease 
(~33 kDa) [42, 43], which not only processes at its own 
flanking sites within the polyproteins, but also directs the 
processing of all downstream domains of the replicase poly- 
proteins via at least 11 conserved cleavage sites [25, 26, 44] 
(see Fig. 1). It is termed the main protease because of its 
dominant role in processing replicase polyproteins and gene 
expression. The alternative name of 3C-like protease was 
designated after the picornavirus 3C proteases because of 
the similar substrate specificities and the identification of 
cysteine as a catalytic residue in the context of a predicted 
two-B-barrel structure [45, 46]. 


The existence of the coronavirus M?” was originally pre- 
dicted by sequence analysis of IBV replicase polyprotein in 
1989 [46]. Although comparative sequence analysis was 
subsequently extended to include the replicase genes of 
MHV, HCoV-229E and TGEV [36, 38, 39], the first ex- 
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Fig. (1). The domain organization and proteolytic processing of coronavirus replicase polyproteins from human coronavirus 229E (HCoV- 
229E), murine hepatitis virus (MHV), SARS coronavirus (SARS-CoV) and avian infectious bronchitis virus (IBV). Shown are the replicase 
polyproteins ppla and pplab; the processing products of ppla are termed non-structural proteins (nsp) 1-11, and those of pplab are termed 
nspl-nsp10 and nsp12-nsp16. Cleavage sites for the main protease (M"®) are marked in orange arrows; cleavage sites for the papain-like 
cysteine protease 1 (PL1""°) are marked in yellow arrows; and cleavage sites for the papain-like cysteine protease 2 (PL2"°) are marked in 
blue arrows. Key to abbreviations: Ac, acidic domain; PL/””’, papain-like cysteine protease 1; X, X domain with adenosine diphosphate- 
ribose 1"-phosphatase activity; SUD, SARS-CoV unique domain; PL2””’, papain-like cysteine protease 2; Y, Y domain containing a trans- 
membrane domain and a putative Cys/His-rich metal binding domain; TM/, TM2, TM3, transmembrane domains 1, 2, 3; M?’”’, main protease 
(or 3C-like protease); RdRp, RNA-dependent RNA polymerase; HEL, helicase; ExoN, 3'-to-5' exonuclease; XendoU, poly(U)-specific en- 
doribonuclease; MT, S-adenosylmethionine-dependent ribose 2'-O-methyltransferase; C/H, Cys/His-rich domains predicted to bind metal 
ions. IBV ppla and pplab do not possess a counterpart to nsp1 of other coronaviruses. The PL1""”° of IBV is crossed out to indicate that it is 
proteolytically inactive. Figure adapted from [25]. 
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perimental evidence of protease activity was reported for 
IBV in 1994 [47]. Afterwards, extensive activity studies of 
MP” in different expression systems were extended to MHV 
and HCoV [48, 49]. The information accumulated from 
different coronaviruses can be used to map the MP?” proc- 
essing of replicase polyproteins from all three coronavirus 
groups [25] (see Fig. 1). 


2.2. Overall Structure 


The crystal structure of TGEV M?” at 1.96 A, which was 
the first structure of any MP?" to be solved, was reported in 
2002 [43]. During the SARS outbreak and shortly after the 
SARS epidemic in 2003, the structures of HCoV 229E M?° 
and SARS-CoV M?" were respectively solved at 2.54 A and 
1.9 A [42, 50]. Later, several structures for SARS-CoV MP? 
were published one after the other with the aim of drug de- 
velopment [51-53]. TGEV and HCoV 229E both belong to 
group | [2]. SARS-CoV is considered to be an early split-off 
from group 2 and has been classified as a group 2b CoV [44, 
54]. HCoV 229E M?® and TGEV M?" share 61 % primary 
sequence identity, whereas SARS-CoV M?" has only about 
40 % sequence identity with these two MP's (see Fig. 2). 
Although the three M?"’s are from two different groups, they 
share a similar overall structure. It is interesting to find that 
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although SARS-CoV MP” shares a slightly higher sequence 
identity with TGEV M?” (44 %) than with the HCoV 229E 
enzyme (40 %), the structure is significantly more similar to 
that of the latter (r.m.s. deviations 2.3 A and 1.6 A, respec- 
tively), which explains why HCoV 229E M?"” is a better 
search model in determining the structure of SARS-CoV 
MP” by molecular replacement. 


All three main proteases are comprised of three domains 
(see Fig. 3A and B). Domains I (TGEV M?”: residues 8- 
100; HCoV MP: 8-99; SARS-CoV MP: 8-101) and II 
(TGEV MP": residues 101-183; HCoV MP?®: 100-183; 
SARS-CoV MP®: 102-184) both have an anti-parallel B- 
barrel fold, which is similar to the serine proteases of the 
chymotrypsin family. These two domains have very limited 
similarity with picornavirus 3C proteases [55-58], which 
also have a chymotrypsin-related structure, suggesting that 
the name main protease is more appropriate than the some- 
what misleading 3C-like protease. The superposition of 
domains I and II of the TGEV M”® onto the equivalent do- 
mains of the HAV 3C protease yields an r.m.s.d. of 2 A for 
114 equivalent (out of 184 compared) C, pairs [43]. The 
substrate-binding site is located in a cleft formed between 
domains I and II and the catalytic site lies at the center of the 
cleft. In contrast to picornavirus 3C proteases, there is an 
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Fig. (2). Multiple sequence alignment of coronavirus MPs ‘from human coronavirus 229E (HCoV-229E), murine hepatitis virus (MHV), 
SARS coronavirus (SARS-CoV) and avian infectious bronchitis virus IBV). The catalytic dyad (41-His and 144/145-Cys) is shown by blue 


triangles. 
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Fig. (3). A). The structure of the porcine transmissible gastroenteritis virus (TGEV) M"° monomer. The M”° monomer structure is drawn in 
ribbon representation and colored from blue at the N-terminal to red at the C-terminal. The catalytic dyad (41-His and 144-Cys) are shown as 
sticks and domains I, II and III are marked. B). Superposition of coronavirus M?"’s from human coronavirus 229E (HCoV-229E; blue), mur- 
ine hepatitis virus (MHV; yellow), SARS coronavirus (SARS-CoV; red) and avian infectious bronchitis virus (IBV; green). It should be 
noted that the refinement of IBV M?" is still in progress and the MHV MP” structure is a homology model. Please see [3] for further details. 


additional domain III in CoV MPs with a unique topology 
[43]. Domain III (TGEV M?®: residues 200-302; HCoV 
M?”: 200-300; SARS-CoV M?®: 201-303) contains five 
largely anti-parallel a-helices arranged into a globular clus- 
ter, and is connected with domain II via a long loop region 
consisting of 16 amino acids. The N-terminal within domain 
I folds onto domain III, bringing it into close proximity with 
the C-terminal of domain IIT. 


CoV M?® can form a homodimer both in the crystal and 
in solution (see Fig. 4). In the crystal, one protomer is ori- 
ented perpendicular to the other and the solvent accessible 
surface area (per protomer) buried upon, dimerization of 
each protomer ranges from 1300~1600 A*. In each pro- 
tomer, domains II, II and the “N-finger” are involved in 
dimer interface formation. The “N-finger’, defined as the N- 
terminal residues 1-7, inserts between domains II and III of 
the partner subunit [50]. Dynamic light scattering results 
show that both HCoV 229E and TGEV M?"s exist as a 
mixture of monomers (~65%) and dimers (~35%) in diluted 
solutions (1-2 mg/ml). Analytical gel filtration, ultracentri- 
fugation and isothermal titration calorimeter experiments 
both indicate that SARS-CoV MP® is also a mixture of 
monomers and dimers in solution [59-61], although the val- 
ues of the dissociation constant (Kq) determined by the dif- 


ferent methods are incongruent, ranging from WM to nM 
[51, 59, 62, 63]. While the N-terminal and domain III are 
observed to participate in dimerization interactions in the 
crystal structure, the role of the N-terminal in maintaining 
the quaternary structure has been reported with inconsistent 
results. In their SARS-CoV M?” study, Chou et al. showed 
that an ion pair formed by 4-Arg at the N-terminal of one 
protomer and 290-Glu at the C-terminal of domain III in its 
partner is crucial in dimer interactions. Mutating either of 
these two residues will increase the dissociation constant 
[62]. Another experiment demonstrated that deletion of 1-3 
amino acids from the N-terminal will not have a substantial 
effect on dimerization, whereas the truncation of amino 
acids 1-4 shows a major form of a monomer [63]. However, 
Shi et al. reported that after the dissection of SARS-CoV 
MP" into two parts, the chymotrypsin fold containing do- 
mains [| and II is a monomer while the additional domain III 
exists as a stable dimer [64]. Chen ef al. reported that K, of a 
N-terminal (1-7 amino acids) truncated SARS-CoV M?® 
changes slightly compared with the full-length protease by 
isothermal titration calorimeter analysis, suggesting that the 
N-terminal should not be indispensable for dimerization 
[61]. Nonetheless, it is generally accepted that domain III is 
essential to maintain quaternary structure and the “N-finger” 
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N-finger 


Fig. (4). The SARS-CoV M?” in complex with a CMK inhibitor, as reported by Yang et al. [50]. The M?” is shown in ribbon representation 
with protomer A in red and protomer B in blue, and is covered by a transparent molecular surface. The CMK inhibitors bound to protomers 


A and B are shown in yellow stick representation. 


is important for enzymatic activity, the latter of which will 
be discussed below. 


Another question is whether the role of domain III is 
only limited to stabilization of the dimer, thus maintaining 
the activity of the M?"”, as has been verified by loss of activ- 
ity following deletion of part or whole of domain III in 
TGEV and SARS-CoV MPs [43, 60, 63, 65]. In characteri- 
zation experiments of temperature-sensitive MHV mutants, 
substitution of 291-Phe in domain III of M?° with Leu 
causes an RNA-minus phenotype at the restrictive tempera- 
ture [66]. In the MHV MP?” homology model constructed 
from SARS-CoV MP?” [3], 291-Phe is located far from the 
active site and does not participate in dimerization, and so 
there is little probability of this mutation independently 
affecting the protease activity. As 291-Phe is situated near 
the protease surface, it has the opportunity to interact with 
other domains in polyproteins. One possible role for domain 
III might be to regulate the proleolytic processing pathways 
for the polyproteins through interaction of specific regions 
of downstream domains of the replicase polyproteins, al- 
though we can not exclude other functions of domain III in 
RNA synthesis. 


2.3. Catalytic Site and Substrate Binding Pocket 


It was originally proposed from IBV sequence analysis 
that the catalytic center of CoV M?” resembles that of other 
viral 3C and 3C-like proteases and might possibly include a 
His-Asp(Glu)-Cys catalytic triad [46]. Mutagenesis studies 
have confirmed that His and Cys residues are essential com- 


ponents of the catalytic centre in HCoV 229E, IBV, MHV 
and SARS-CoV M?"’s [48, 49, 51, 67-71]. There has been a 
long-stranding debate on the existence of a third member in 
the catalytic center of CoV Ms. Great efforts made in 
sequence analysis and mutagenesis studies could not com- 
pletely solve this question [26]. Only when the crystal 
structure of TGEV MP? was determined in 2002 was this 
problem finally clarified [43]. Compared with serine prote- 
ases and other cysteine proteases, which adopt a catalytic 
triad, CoV M?” only takes His-Cys as a catalytic dyad and 
completely lacks a third catalytic residue [3, 42, 43, 50]. In 
contrast to picornavirus 3C proteases, TGEV, HCoV 229E 
and SARS-CoV M?"°s all have a particularly ordered water 
molecule in the corresponding position to the third catalytic 
member. This water forms at least three hydrogen bond 
interactions with surrounding residues (including the cata- 
lytic dyad member His) in all M? structures determined to 
date, suggesting its possible role is to stabilize the proto- 
nated histidine in the intermediate state during proteolytic 
cleavage, reminiscent of the function of Asp (Glu) in classic 
serine protease catalysis. The replacement of cysteine in the 
charge-relay system with a serine in IBV and SARS-CoV 
will produce a protease with residual activity, supporting a 
classical general base mechanism related to serine proteases 
rather than a thiolate-imidazolium mechanism for papain- 
like proteases [69, 71]. 


Coronaviruses only encode one M?, which is highly 
selective for substrates. A typical substrate sequence for 
MP” is Leu-Glnl(Ser, Ala, Gly) in polyproteins, which is 
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conserved among different coronavirus Ms [26]. Hegyi et 
al. discovered that a synthetic peptide representing the N- 
terminal HCoV 229E MP” cleavage site was shown to be 
effectively hydrolysed by non-cognate M?"’s [72]. In studies 
on SARS-CoV M?®, it was reported that the N-terminal 
SARS-CoV M?® processing sequence could be efficiently 
cleaved by Ms of all other groups of CoV, including the 
recently identified HCoV-NL63 and HCoV-HKUI [3, 42, 
73]. This is explained by a comparison of representatives 
from all three groups of CoVs: the structures of TGEV 
(group 1), HCoV-229E (group 1), SARS-CoV (group 2b) 
MPs, a homology model of MHV (group 2a) M?” and the 
structure of IBV M””? under refinement [3, 27, 42, 43, 50]. 


Superposition of the structures and model reveal that 
backbones of the CoV M?"°s substrate binding pocket su- 
perimpose particularly well (see Fig. 3B), except for a small 
segment located on the outer wall of S2 [3], and so attention 
is focused on the variation of side chains forming important 
subsites S1, S2, S4 and S1'. In coronavirus M?"s, the S1 
subsite endows the peptidases with unique specificity for 
substrate recognition and P1 is invariably occupied by Gln 
in polyprotein processing. Anand and colleagues reported 
the structure of TGEV M?® in complex with a substrate 
analog, hexapeptidyl chloromethyl ketone inhibitor, Cbz- 
Val-Asn-Ser-Thr-Leu-Gln-CMK, derived from the N- 
terminal processing cleavage site [42]. This structure re- 
vealed that the side chains of 165-Glu, 162-His, 171-His, 
and 139-Phe in TGEV MP?” (also conserved in other M?"’s) 
are incorporated with other backbone elements to constitute 
the S1 site, which has an absolute requirement for Gln at the 
P1 position via two hydrogen bonds. One hydrogen bond is 
formed by the side chain of Gln and the Ne atom of His-162, 
which is situated at the bottom of S1 subsite. The required 
neutral state of His-162 at physiological pH appears to be 
maintained by two important interactions: (i) stacking with 
the phenyl ring of Phe-139 and (ii) acceptance of a hydrogen 
bond from the hydroxyl group of the buried Tyr-160, which 
is conserved in other known SARS-CoV and HCoV 229E 
MP?” structures [42, 50-52]. The role of His-162 and Tyr-160 
was experimentally verified by mutagenesis analysis of 
corresponding residues in HCoV 229E and FIPV MPs [68, 
74]. Inserting in between domain III of the parent protomer 
and domain II of the neighboring protomer in the dimer, the 
N-finger plays an important role for the protease activity. In 
particular, the first amino acid of the N-finger in the parent 
protomer commonly forms ionic or hydrogen bond interac- 
tions with the essential residues constituting S1 site of the 
partner protomer, helping to maintain the active conforma- 
tion of this subsite [43, 50, 52]. Deletion of residues 1-5 in 
the related TGEV MP?” renders the enzyme almost com- 
pletely inactive [43]. Site-direct mutation of the crucial resi- 
due or deletion of the whole N-finger will result in a great 
loss of activity in SARS-CoV MP?” [61-63, 65]. 


The side chains of 164-Leu, 51-Ile, 41-His, and 53-Tyr, 
as well as the alkyl portion of side chains of 186-Asp and 
47-Thr, are involved in forming a deep hydrophobic S2 
subsite that can accommodate the relatively large side chain 
of Leu in TGEV M?””. This same feature can also be ob- 
served in the HCoV-229E M?”. Several conservative sub- 
stitutions occur in other CoV M?"’s (164-Leu > 165-Met in 
SARS-CoV and MHV MP"s; 53-Tyr — 50-Trp in IBV 
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M?). Another minor difference is observed in SARS-CoV 
and MHV M?"’s, where the outer wall segment is composed 
of a short 3,9-helix from residues 45-50, in contrast to the 
less regular structure in HCoV and TGEV M?"s. With re- 
spect to the structure of IBV M°” undergoing refinement, no 
clear electron density was observed in the corresponding 
stretch of residues 44-47. Apart from a few exceptions, CoV 
M?’ has a Leu residue in the P2 position [26]. However, 
SARS-CoV M?° has Phe at the P2 site in its C-terminal 
cleavage sequence, representing a structural difference in the 
S2 site compared with TGEV and HCoV M?"s [51, 52]. The 
side chain of the residue at the P3 position in all Ms is 
solvent-exposed, so this site was expected to tolerate a wide 
range of functionality. The side chains of 164-Leu, 166-Leu, 
184-Tyr, and 191-Gln that form the S4 hydrophobic subsite 
of TGEV are conserved in other CoV M?"s, with the excep- 
tion of the following conservative substitutions: 184-Tyr > 
184-Phe in HCoV MP”; 164-Leu — 165-Met, 184-Tyr > 
185-Phe in SARS-CoV. The congested $4 site implies that 
only small amino acid residues such as Ser, Thr, Val or Pro 
can be accommodated at the P4 position [27]. Although the 
SI’ subsite of TGEV M?” formed by 27-Leu, 41-His and 
47-Thr is not deep, similar with other CoV Ms, it has 
sufficient space for common small P1’ residues such as Ser, 
Ala, or Gly to extend into this subsite. This could help in the 
design of substrate-analogue inhibitors targeting CoV M's 
with higher binding affinity [3]. Yang and colleagues have 
solved the structure of SARS-CoV M?® mutant in complex 
with an 1l-mer peptidyl substrate, which could provide 
whole insight into the interactions between substrate and 
protease (Rao, personal communication). In the first crystal 
structure of SARS-MP"° determined at pH 6.0, the S1 speci- 
ficity pocket of one protomer is in the active conformation 
while that of the neighboring protomer is partially collapsed, 
resulting in only about 50 % protease activity. Activity 
curves show that SARS-CoV protease activity increases 
from pH at 6.0 to approximately physiological pH [50, 53, 
59], and so a pH-triggered switch for the catalytic activity of 
the peptidase was proposed [50]. Tan and colleagues have 
reported similar results by structural analysis and molecular 
dynamics simulations [53]. The crystal structures solved by 
Lee and co-workers from crystals with different space 
groups at pH 6.5 showed that the substrate-binding regions 
of both protomers are in the catalytically competent confor- 
mation. Thus the authors proposed the possibility of an al- 
ternative or additional mechanism [52]. 


Although CoV MP” has similar substrate specificity with 
picornavirus 3C proteases as mentioned above, no reports 
have shown that any CoV M?" could efficiently process the 
substrate of picornavirus 3C proteases, or vice versa, which 
can be inferred from the differences substrate binding pock- 
ets between CoV MPs and picornavirus 3C proteases. 


It is generally agreed that M’° commonly processes the 
downstream cleavage sites in polyproteins in trans, which 
was characterized for M's of various species [25]; how- 
ever, it remains largely unknown whether the M”” releases 
itself from polyproteins in trans or in cis. In TGEV MP®° 
structures, it seems reasonable to suggest that the cleavage 
even occurs on an intermolecular basis, which has also been 
suggested for MHV M?”® based on biochemical evidence 
[75]. However, the previously reported structures of M?™ are 
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all in the mature form without any clues for the flanking 
hydrophobic regions (TM domains). A number of MHV and 
IBV precursors containing M"° were found to require mi- 
crosomal membranes for efficient autocleavage to release 
MP” [69, 76], suggesting that the autocleavage event might 
take place on the membrane and that hydrophobic regions 
help the cleavage sites to approach the active site by inter- 
action with the membranes. It is difficult to tell whether 
cleavage in cis will occur or not without further considera- 
tion of the interaction between TM domains and mem- 
branes. In the N-terminal autocleavage event, for instance, 
no matter whether the autocleavage is in cis or in trans 
within a dimer, domain I will undergo a large conforma- 
tional change between pre and post cleavage. Hence, the 
structure of M?° with flanking hydrophobic regions will 
help to address this question. Recently, Hsu et al. reported 
that they found in a dimeric structure of C145A mutant of 
SARS-M?", the active site of one protomer contains the C- 
terminal of one protomer in another asymmetric unit. There- 
fore, they proposed an autocleavage mechanism to explain 
this [51]. 


3. INHIBITOR DESIGN 
3.1. Enzyme Activity Assay 


In the development of CoV M?” inhibitors, efficacious 
methods are required to measure the enzyme activity and the 
effect of compounds screened. Initially the proteolytic ac- 
tivity of M"° was characterized mainly through processing 
the expressed polypeptide in vitro and analyzing the prod- 
ucts by electrophoresis [48, 49]. Later the HPLC method 
was utilized to analyze the cleavage of M””° on synthesized 
peptides [42, 72, 77] and determine the kinetic parameters 
for the proteases [59, 68]. Using an HPLC-based peptide 
cleavage assay, Fan and colleagues synthesized 34 peptide 
substrates and evaluated their specificity for SARS-CoV 
MP” by measuring hydrolytic activity [78]. However, these 
two methods are not sensitive and convenient enough for 
large-scale inhibitor screening. Afterwards a continuous 
colorimetirc assay was reported as a substitute for SARS- 
CoV M?® activity assay based on cleavage at the Gln-pNA 
bond for a substrate Thr-Ser-Ala-Val-Leu-Gln-pNA [79]. At 
present, a fluorescence resonance energy transfer (FRET) 
technique based on a fluorescence-labeled substrate has 
been developed and is routinely utilized to monitor the M”° 
activity in vitro for anti-CoV inhibitor screening and drug 
design [51, 52, 60-63, 73, 80-88]. Parea and co-workers. 
invented a genetic screen assay to monitor the SARS-CoV 
MP activity, in which the protease activity is associated 
with the proliferation of lambda phage in infected Escheri- 
chia coli cells [89]. 


3.2. Design of Inhibitors Targeting CoV M?° 


To date, the inhibitors reported for CoV MP?” have mostly 
been compounds targeting the active site of CoV M?®. Al- 
though inhibitors preventing M”° dimerization were sug- 
gested [27, 64], there have been no related reports. In the 
early stages of CoV M”” studies, a series of typical serine, 
cysteine, aspartic acid and metalloprotease inhibitors were 
screened for CoV MPs. Ziebuhr and colleagues reported 
that 3,4-dichloroisocoumarin, phenylmethylsulfony! fluoride 
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(PMSF), Pefaloc SC, tosyl lysyl chloromethyl ketone 
(TLCK), antipain and ZnCl, all function as HCoV 229E 
MP” inhibitors [68]. A cysteine protease inhibitor E64d was 
shown to prevent MHV RNA synthesis and virus replica- 
tion, possibly by inhibition of MHV M?” processing activity 
[75]. However, these inhibitors were assayed primarily for 
characterization of the CoV MP” active site and its role in 
viral RNA synthesis and replication, but not for pharmaceu- 
tical use due to their non-specificity. 


During the global SARS epidemic, the M?° was consid- 
ered to be an attractive target [42], which led to an upsurge 
in the development of anti-SARS-CoV MP” inhibitors. Since 
the structure of SARS-CoV M?” was not available at the 
height of the outbreak, a series of homology models were 
constructed from the TGEV M?® and HCoV 229E M?® for 
virtual screening of compounds suitable for binding into the 
substrate binding pocket [42, 90-93]. However, no further 
reports have shown that the compounds suggested above are 
effective in inhibiting SARS-M?” activity or preventing 
viral replication. In the wake of the SARS outbreak, the 
inhibitors were developed predominantly by three methods: 
(1) large-scale screening of structurally diverse small mole- 
cules from compound libraries or natural sources such as 
plant extracts; (2) virtual screening of compounds from 
chemical databases based on the SARS-CoV M?” structure; 
and (3) ab initio design of small molecules directly from the 
structure, which will be respectively discussed below. A 
series of representative inhibitors identified against SARS- 
CoV MP” is listed in Table 1. 


3.2.1. High-Throughput Screening 


The rapid, sensitive fluorescence resonance energy trans- 
fer (FRET) technique provides the possibility to simultane- 
ously screen a large number of compounds from available 
drugs, chemical libraries or natural plant extracts. Except for 
a few cases, most research groups have chosen to adopt this 
method. Hsu and co-workers reported that metal ions such 
as Hg**, Zn** and Cu** are SARS-CoV MP?” inhibitors, pos- 
sibly because they have the ability to coordinate to the cata- 
lytic cysteine in the protease active site [94]. From a com- 
pound library, the authors also found that phenylmercuric 
nitrate (see Table 1), thimerosal and hexachlorophene are 
effective inhibitors against SARS-CoV M?”. However, safety 
is the main concern of the metal ions and particularly the 
toxicity of compounds containing mercury. Although pre- 
liminary data showed that hexachlorophene can decrease the 
amount of viral spike protein in the Vero E6 cells infected 
by SARS-CoV below the cytotoxicity concentration, the 
effect of this compound needs to be fully characterized in 
the prevention of viral replication and cytotoxicity. Liu and 
colleagues reported similar results for hexachlorophene and 
its derivatives in enzymatic inhibition assays [88]. Another 
study by Wu and co-workers reported that a transition-state 
analog inhibitor of HIV protease is active against the SARS- 
CoV M?® (K=0.6 UM) [82]. In a large scale s creening, Kao 
and colleagues screened 50,240 small molecules, in which 
they identified 104 compounds with anti-SARS-CoV activ- 
ity. From these active compounds, the authors discovered 
one compound named MP576 that displays inhibition activ- 
ity against SARS-CoV MP?” with an ICs of 2.5 uM and an 
ECso of 7 UM in cell-based assays [81]. Similarly, Blanchard 
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Table 1. Inhibition of SARS CoV M?” Activity In Vitro and SARS-CoV in Cell Culture by Published Compounds 


Compound 


Phenylmercuric 
nitrate [94] 


Structure 


Enzymatic 
assay 


0.3 UM (K;) 


Antiviral activity 
in cell-based 
assay (uM) 


Hexachlorophene 
[88, 94] 


4-13.7 UM (K;) 


~10 (ECs0) 


Compound 2 [82] 


0.6 UM (K;) 


MP576 [81] 


2.5 UM (ICso) 


7 (ECs0) 


MAC-5776 [83] 


0.5 UM (ICso) 


Theaflavin-3,3'- 
digallate (TF3) [95] 


9.5 UM (ICso) 
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(Table 1) contd.... 


Antiviral activity 


E ti 
Me in cell-based 


Compound Structure 
assay 


assay (uM) 


Hesperetin [96] 60 UM (ICs0) 


Calmidazolium [79] 61 UM (Ki) 


Cinanserin [73] 4.92 UM (ICs) 31 Cs) 


Aza-peptide 18 uM (Ki) 
epoxides [52] : 35x10°S"! (k3) 


FL-166 [60] 0.04 UM (K;) 


8c [80] 0.6 UM (ICso) 
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(Table 1) contd.... 


Compound Structure 


Yang et al. 


Antiviral activity 
in cell-based 
assay (uM) 


Enzymatic 
assay 


0.03 UM (K;) 


0.95 UM (ICs) 


“N/A : data not available. 
° Unpublished data. 


and colleagues screened 50,000 small molecules and identi- 
fied 5 compounds exhibiting inhibitory activity (ICso= 0.5-7 
uM) towards SARS-CoV M?” [83]. In addition to small 
molecules from chemical libraries, some research groups 
have turned to natural plant extracts such as Chinese herbs. 
Two research groups individually claimed that tannic acid, 
3-isotheaflavin-3-gallate and theaflavin-3,3'-digallate from 
black tea and one plant-derived phenolic compound, hes- 
peretin, are SARS-CoV M?” inhibitors [95, 96]. 


3.2.2. Virtual Screening 


In a virtual screening of compounds from several chemi- 
cal databases, Liu and colleagues identified calmidazolium, 
an antagonist of calmodulin, as a SARS-CoV MP?” inhibitor 
with a K; of 61 uM from enzyme inhibition assays [79]. 
Chen and co-workers identified cinanserin, a serotonin an- 
tagonist, as an inhibitor of both SARS-CoV and HCoV- 
229E Ms from a database containing more than 8,000 
compounds by a docking approach [73]. Cinanserin has an 
ICs9 of about 5 uM against both of these two proteases in 
enzyme inhibition assays, and an ECs, ranging from 19 to 
34 uM in cell-based assays. The authors also reported that 
the enzyme was not completely inhibited at the maximum 
drug concentration, while the amount of virus was signifi- 
cantly reduced in cell culture. This incongruity was ex- 
plained as additional drug effects by the authors, but could 
arise if cinanserin acts as an irreversible inhibitor with a 


9 uM (Ki) 


3.1x10°S" (ks) oo 


small inactivation rate constant, and so could not completely 
inactivate the protease in a limited time during the enzyme 
activity assay. 


3.2.3. Ab Initio Inhibitor Design 


pro 


Many compounds designed directly from the M’” struc- 
ture are mechanism-based irreversible inhibitors, which 
could increase their inhibition effects. The inhibitor initially 
forms a reversible complex with the protease, which then 
undergoes a chemical step (nucleophilic attack by Cys) 
leading to the formation of a stable covalent bond. The 
evaluation of this series of inhibitors requires both the equi- 
librium-binding constant K; (designated as k/k,) and the 
inactivation rate constant for covalent bond formation k; 
(see Equation (1)) [3, 97]. 


E+ EI—*5E-1 (1) 


The first reported SARS-CoV MP” irreversible inhibitor 
was a substrate-analog chloromethyl ketone (CMK) inhibi- 
tor, Cbz-VNSTLQ-CMK [50]. The sequence of this sub- 
strate-analog was derived from residues P6—P1 of the N- 
terminal autoprocessing site of TGEV M?. However, the 
two protomers of SARS-CoV M?® each exhibited an unex- 
pected binding mode (see Fig. 4), possibly resulting from 
the comparatively weak binding of peptidyl elements de- 
rived from the substrate of TGEV M?” and from the highly 
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reactive electrophile CMK. This would suggest that nucleo- 
philic attack might have occurred before a stable non- 
covalently bound enzyme-inhibitor complex was formed [3]. 
It should be noted that the CMK compound can react with a 
variety of proteases, including cellular proteases, without 
specificity due to their high electrophilicity. Thus, the CMK 
compound is mainly utilized to explore the active sites of 
proteases but is not suitable for direct use in drug develop- 
ment. Recently, Lee and colleagues designed an aza-peptide 
epoxide inhibitor [52], Cbz-Leu-Phe-AGIn-EP-COOEt and 
solved the co-crystal structure in which the Cys145 S' atom 
of the protease forms a covalent bond with the epoxide C3 
atom of the inhibitor. As expected, this substrate analog 
binds to the substrate binding pocket of the protease in a 
normal mode, as seen in the TGEV M?® complex [42]. 
Bacha and co-workers identified a cluster of serine residues 
near the active site cavity which were susceptible to target- 
ing by compounds containing boronic acid. In a series of 
designed bifunctional aryl boronic acid compounds, a com- 
pound named FL-166 is effective at inhibiting the protease 
with a K; of 40 nM [60]. Jain and colleagues reported a se- 
ries of keto-glutamine analogues with a phthalhydraido 
group at the G-position, whose derivatives are known HAV 
3C inhibitors, could act as reversible inhibitors against 
SARS M?® with ICs9 ranging from 0.6 to 70 UM [80]. Shie 
and colleagues reported that an anilide derived from 2- 
chloro-4-nitroaniline, L-phenylalanine and 4-(dimethyl- 
amino) benzoic acid can reversibly inhibit SARS-CoV M?® 
with a K; of 30 nM [87]. Chen and co-workers reported that 
a series of isatin (2,3-dioxindole) derivatives, which are 
known covalent inhibitors against rhinovirus 3C protease, 
could inhibit SARS-CoV MP” activity [98]. Unfortunately, 
previous studies have also shown that isatin compounds 
displayed poor antiviral activity [97], which might limit 
their use in drug development against SARS-CoV. In a re- 
cent study, Yang and colleagues proposed a strategy for 
preventing infection by existing and possible future emerg- 
ing coronaviruses. After analyzing the critical subsites of 
M? substrate binding pocket from representatives of all 
three groups of CoVs MP’’s, the authors concluded that all of 
those subsites are highly conserved. A compound (desig- 
nated as N3) incorporating a trans-o., B-unsaturated ester 
with the peptidyl portion targeting this conservative region 
could rapidly inactivate multiple M's covering all three 
groups of CoVs in vitro, and shows potent antiviral activity 
with extremely low cytotoxicity. A uniform inhibition 
mechanism was elucidated from the structures of M?°- 
inhibitor complexes from SARS-CoV and TGEV. In the 
SARS-CoV MP?” complex with N3 (see Fig. 5), for instance, 
this compound binds to the shallow cleft formed by a por- 
tion of the strand elI and a segment of the loop linking do- 
mains II and III. The Cy, atom of the Michael acceptor forms 
a covalent bond with the S, atom of 145-Cys. The P1, P2, P4 
and PI' groups of the inhibitor insert favorably into thier 
corresponding subsites. It is of interest that this compound 
could also rapidly inactivate the M's of HCoV-NL63 and 
HCoV-HKU1, two recently identified HCoVs associated 
with bronchiolitis and pneumonia. The authors expect that 
further modification of these inhibitors could rapidly lead to 
the discovery of a single agent with clinical potential against 
existing and possible future emerging CoV-related diseases 


[3]. 
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4. OTHER THERAPEUTIC TARGETS IN THE SARS- 
COV 


Much of the focus in structure-based drug design target- 
ing the SARS coronavirus has so far been made on the MP”. 
However, a slew of recent SARS-CoV protein structures 
have opened new avenues towards the design of SARS anti- 
viral compounds, as well as for vaccine design. Of particular 
importance is the spike (S) glycoprotein, one of four struc- 
tural proteins that are required for viral assembly. The S 
protein is mainly responsible for binding to the host cell and 
for subsequent cell entry by virus-cell membrane fusion. 
Great attention is also being made to identify targets in- 
volved in the replication and transcription of the virus, 
which was mediated by machinery composed of so-called 
non-structural proteins. 


4.1. SARS Spike Protein Fusion Core 


The SARS-CoV S protein can be subdivided into an N- 
terminal half (S1) and C-terminal half (S2), but without 
proteolytic cleavage [99]. S1 is responsible for binding to 
cellular receptors, while. S2 contains an internal fusion pep- 
tide and has two hydrophobic (heptad) repeat regions desig- 
nated HR1 and HR2 [100]. 


The structure of the spike (S) protein fusion core con- 
sisting of HR1 and HR2 regions was first determined in 
2004 by two groups in the post-fusion (or fusion-active) 
state (Fig. 6) [99, 101]. Briefly, the structure of the S fusion 
core structure is a six-helix bundle in which three HR1 heli- 
ces form a central coiled-coil surrounded by three HR2 heli- 
ces in an oblique, anti-parallel manner. HR2 peptides pack 
into the hydrophobic grooves of the HR1 trimer in a mixed 
extended and helical conformation, representing a stable 
post-fusion structure, similar to that observed for HIV-1 
gp41 [102]. 


The classical mechanism of enveloped virus and host- 
cell membrane fusion mediated by class I fusion proteins 
was first established by Don Wiley and colleagues from 
comprehensive studies of the influenza hemagglutinin (HA) 
[102, 103]. In subsequent years, extensive structural studies 
on the orthomyxovirus, retrovirus, paramyxovirus, and 
filovirus families have yielded a common fusion mechanism 
[102]. To confirm the value of fusion proteins as antiviral 
targets, one HIV-1 membrane fusion-inhibitory peptide T-20 
(Trimeris, Research Triangle Park, NC, USA), targeting the 
prehairpin intermediate, was recently approved by the US 
Food and Drug Administration as a new anti-HIV drug 
[104]. In the case of SARS-CoV, several peptides derived 
from the HRI and HR2 regions of SARS-CoV spike pro- 
teins have been synthesized to block viral entry targeting the 
putative pre-hairpin intermediate [104-106]. Importantly, 
two groups have found that only peptides derived from HR2, 
and not from HRI, inhibited SARS-CoV infection [104, 
105]. Moreover, the efficacy of HR2 peptides derived from 
SARS-CoV spike protein is lower than those of correspond- 
ing HR2 peptides of murine coronavirus mouse hepatitis 
virus in inhibiting MHV infection [105]. One possibility 
might be the lower affinity of these peptides for the corre- 
sponding HRI trimer [105], since a larger surface area is 
buried in the HR1-HR2 interface of MHV S2 than in SARS- 
CoV S2 [99]. In any case, elucidation of the HR1-HR2 
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Fig. (5). A). Surface representation of SARS-CoV M?” (blue, with residues shown in yellow) complexed with the N3 inhibitor (green) re- 
ported by Yang etal. [3]. Water molecules are shown as red spheres. The P1, P2, P3, P4, PS and P1' groups and residues forming the sub- 
strate binding pocket are labeled. B). Detailed view of the interactions between SARS-CoV M?” and the N3 inhibitor reported by Yang et al. 
[3]. The N3 inhibitor is shown in blue. Hydrogen bonds are shown as dashed lines and interaction distances are given. The covalent bond is 
labeled in red. 
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fusion core structure should prove beneficial for the discov- 
ery of viral entry inhibitors against SARS. 


4.2. SARS-CoV Spike Protein Receptor Binding Domain 
in Complex with the Cellular Receptor ACE2 


As previously described, the S1 region within the spike 
protein features the receptor binding site while the $2 region 
mediates the fusion activity. One potential SARS-CoV re- 
ceptor has been identified as angiotensin-converting enzyme 
2 (ACE2) [107], and a fragment of the S1 region has been 
shown to be required for tight binding to the peptidase do- 
main of ACE2 [108-110]. It is this fragment, termed the 
receptor binding domain (RBD), which mediates the virus- 
receptor interaction and thus determines the viral host range 
and tropism. Changes in just a few residues in the RBD can 
result in efficient cross-species transmission [111, 112]. 
Furthermore, the RBD also includes important viral-neutral- 
izing epitopes [113-116], suggesting that variants of the 
RBD could be used for the development of protein-based 
vaccines. The recent structure of the RBD in complex with 
the cellular receptor ACE2, determined by Stephen Harrison 
and colleagues [117], should therefore provide an important 
means for the design of effective coronavirus vaccines. 
From the complex structure, the authors observed tight 
complementarity between the shallow concave surface of 
the S protein RBD and one lobe of the human ACE2 pepti- 
dase domain (Fig. 6) and identified residues critical residues 
for species specificity. 


4.3. Non-Structural or Replicase Proteins 


The SARS-CoV replicase gene encodes 16 non-struc- 
tural proteins (nsps) with multiple enzymatic functions, 
which are known or are predicted to include types of en- 
zymes that are common components of the replication ma- 
chinery of plus-strand RNA viruses (see [25] for a review). 
A number of recent structures of replicase proteins, both 
alone and in protein-protein complexes, provide insights into 
the sophisticated function and assembly of the replica- 
tion/transcription machinery. Targeting and inhibiting the 
function of this machinery should be another important fo- 
cus of structure-based drug design. However, little remains 
known about the coronavirus replication/transcription ma- 
chinery, particularly the interactions between the various 
non-structural protein components, and further work is 
needed to identify viable targets. 


Nsp9, the crystal structures of which were determined 
simultaneously in 2004 by two groups [118, 119], have as- 
certained its previously unknown function as a dimeric sin- 
gle-stranded RNA binding protein (Fig. 6). In addition to its 
structure, nsp9 has been shown to interact with nsp8, while 
dual-labeling studies of SARS-CoV replicase proteins have 
demonstrated co-localization of nsp8 with nsp2 and nsp3. 
Zihe Rao and colleagues have determined the first crystal 
structure of a complex between two non-structural proteins, 
nsp7 and nsp8 (Fig. 6) [120]. Eight copies of nsp7 and eight 
copies of nsp8 together form an intricate scaffold that re- 
sembles a hollow cylinder. The inner dimensions and elec- 
trostatic properties of the cylinder suggest that it should 
encircle nucleic acid, and an interaction was demonstrated 
with dsRNA by EMSA and mutagenesis. The architecture 
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and electrostatic properties are reminiscent of PCNA or the 
B-subunit ring, the processivity factors of DNA polymerase, 
suggesting that the nsp7-nsp8 complex should be a proces- 
sivity factor for the RNA-dependent RNA polymerase 
(nsp12). Finally, the crystal structure of the ADRP domain 
of nsp3 has also been determined [121]. 


4.4. Other Targets for Anti-SARS Drug and Vaccine 
Design 


Other targets for anti-viral therapy and vaccine design 
have been identified in the SARS-CoV by structural biol- 
ogy. For instance, antigenic peptides of the coronavirus 
nucleocapsid (N) protein can be recognized on the surface of 
infected cells by T cells [122, 123]. The structure of the 
MHC-I molecule HLA-A*1101 in complex with such a 
peptide derived from the SARS-CoV N protein, a nonamer 
with SARS specific sequence, could be used as a template 
for peptide-based vaccine design [124]. DNA vaccines tar- 
geting the SARS-CoV nucleocapsid have been generated by 
two different groups [21, 23]. 


The crystal structure of the rigorously conserved stem- 
loop II motif (s2m) RNA element from SARS-CoV was 
determined earlier this year [125]. The unusual structural 
features form likely surfaces for interaction with conserved 
host cell components or other reactive sites required for 
virus function. This, together with the high sequence con- 
servation of s2m in an otherwise rapidly mutable RNA ge- 
nome, implies its pathogenic importance and signals that it 
could be another attractive target for the design of antiviral 
therapeutics. 


5. CONCLUSIONS 


Prior to the global SARS outbreak in 2003, CoVs had 
not attracted enough attention from researchers because this 
genus of viruses causes severe diseases dominantly in ani- 
mals and only comparatively mild diseases in humans. The 
SARS epidemic led to extensive studies of this etiological 
agent, the pathogenesis of SARS, and vaccine and drug 
development. The CoV MP”, spike protein, helicase, RNA- 
dependent RNA polymerase (RdRp) and possibly other viral 
(or cellular) proteins involved in pathogenesis are consid- 
ered to be targets for drug design. Among them, the M?” has 
been characterized the most in terms of structure and bio- 
chemistry. Despite this, there are several open questions: (1) 
How does the M?” release itself from the replicase polypro- 
teins, in cis or in trans? (2) Do alternative proteolytic proc- 
essing pathways exist for the replicase polyproteins in 
CoVs, as in equine arteritis virus? If so, how does the M”° 
mediate the pathways? (3) Does the M®® involve in patho- 
genesis in CoV-associated diseases? To answer these ques- 
tions requires further structural and biochemical studies. 


Evidence suggests that CoVs may have completed at 
least two animal-to-human interspecies transmissions to date 
[5, 6, 12, 31], resulting in the emergence of HCoV 229E and 
SARS-CoV. Although SARS has been contained, the dis- 
covery of the natural host for SARS-like CoV has aroused 
new fears for public health. CoVs, especially those that can 
infect hosts such as domestic animals and pets in frequent 
contact with humans, remain a potential threat to human 
health assuming they cross the interspecies barrier again. 
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Fig. (6). A). The S protein fusion core “‘six-helix bundle” structures from Supekar et al. (left, [99]) and Xu eral. (right, [101]). The central 
HR1 peptides are shown in ribbon representation and colored red, blue and green. The HR2 peptides are shown in black. The N- and C- ter- 
mini are labeled. B). The SARS-CoV S protein receptor binding domain (RBD) in complex with the cellular receptor ACE2 [117]. The com- 
plex structure of ACE2 (in green) and the S protein RBD (core structure in cyan and the receptor binding motif (RBM) in red) is shown in 
ribbon representation and covered by a transparent surface. C). The dimer structure of nsp9, a single stranded RNA binding protein [118, 
119]. Each monomer is shown in ribbon representation and colored from blue at the N-terminal to red at the C-terminal. D). The structure of 
the nsp7-nsp8 hexadecameric complex. The complex structure is shown in ribbon representation and covered by a transparent surface. Nsp7 
is shown in green and the two conformations of nsp8 are shown in blue (I) and orange (ID). 


Several key factors controlling the host spectrum and viral 
pathogenicity are highly variable among CoVs, including 
the requirement of different host receptors for cellular entry, 
poorly conserved structural proteins (antigens), and diverse 
accessory genes in their 3'-terminal genome regions that 
probably contribute to the pathogenicity of CoVs in specific 
hosts [2, 8, 28, 29, 107, 126-128]. This structural and func- 


tional diversity presents a significant obstacle for the design 
of a versatile compound against all CoVs. For instance, a 
fusion peptide inhibitor derived from the MHV spike protein 
can not prevent SARS-CoV replication in cell culture [105]. 
Identification of the CoV M®” as a conserved target among 
all CoVs will provide an opportunity for the development of 
broad-spectrum inhibitors against all CoV-related diseases. 
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The development of inhibitors targeting SARS-CoV M?° 
has produced a number of positive results. Some compounds 
exhibit inhibitory effects on both the protease and antiviral 
activity. A series of inhibitors whose backbone incorporates 
a trans-o, B-unsaturated ester with a peptidyl portion seem 
particularly promising, and one of their analogues, ruprin- 
trivir, has entered clinical trials against rhinovirus infection 
[97]. Further modification of these compounds should be 
expected to rapidly lead to the discovery of a single agent 
with clinical potential against existing and possible future 
emerging CoV-related diseases. 
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